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Technical Field 

5 

The present invention relates to an apparatus and 
method for adapting a 2D or 3D stereoscopic video signal; 
and, more particularly to an apparatus and method for 
adapting a 2D or 3D stereoscopic video signal according to 
10 user characteristics and user terminal characteristics and 
a computer-readable recording medium on which a program 
for executing the method is recorded. 

Background Art 

15 

The Moving Picture Experts Group (MPEG) suggests a new 

standard working item, a Digital Item Adaptation ( DIA) . 

Digital Item (DI) is a. structured digital object with a 

standardized representation, identification and metadata, 
20 and DIA means a process for generating adapted DI by 

modifying the DI in a resource adaptation engine and/or 

descriptor adaptation engine. 

Here, the resource means an asset that can be 

identified individually, such as audio or video clips, and 
25 image or textual asset. The resource may stand for a 

physical object, too. Descriptor means information related 
•to the components or items of a DI, such as metadata. Also, 

a user is meant to include all the producer, rightful 

person, distributor and consumer of the DI. Media resource 
30 means a content that can be expressed digitally directly. 

In this specification, the term 'content' is used in the 

same meaning as DI, media resource and resource. 

While two-dimensional (2D) video has been a general 

media so far, three-dimensional (3D) video has been also 
35 introduced in the field of information and 

telecommunications. The stereoscopic image and video are 
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easily found at many Internet sites, DVD titles, etc. 
Following this situation, MPEG has been interested in the 
stereoscopic video processing. The compression scheme of 
the stereoscopic video has been standardized in MPEG-2, 
5 i.e., "Final Text of 12818-2/AMD3 (MPEG-2 multiview 
profile)" at International Standard 

Organization/International Electrotechnical committee 
(ISO/IEC) JTC1/SC29/WG11. The MPEG-2 multiview profile 
(MVP) was defined in 199 6 as an amendment to the MPEG-2 

10 standard with the main application area being stereoscopic 
TV. The MVP extends the well-known hybrid coding towards 
exploitation of inter-viewchannel redundancies by 
implicitly defining disparity-compensated prediction. The 
main new elements are the definition of usage of a temporal 

15 scalability (TS) mode for multi-camera sequences, and the 
definition of acquisition parameters in an MPEG-2 syntax. 
The TS mode was originally developed to allow the joint 
encoding of base layer stream having a low frame rate and 
an enhancement layer stream having additional video frames. 

20 If both streams are available, decoded video can be 
reproduced with full frame rate. In the TS mode, temporal 
prediction of enhancement layer macroblocks can be 
performed either from a base layer frame, or from the 
recently reconstructed enhancement layer frame. 

25 In general, the stereoscopic video is produced using a 

stereoscopic camera with a pair of left and right camera. 
The stereoscopic video is stored or transmitted to the user. 
Unlike the stereoscopic video, the 3D stereoscopic 
conversion of 2D video (2D/3D stereoscopic video 

30 conversion) makes it possible for users to watch 3D 
stereoscopic video from ordinary 2D video data. For 
instance, users can enjoy 3D stereoscopic movies from TV, 
VCD, DVD, etc. Unlike general stereoscopic images acquired 
from a stereoscopic camera, an essential difference is that 

35 the stereoscopic conversion is to generate a stereoscopic 
image from a single 2D image. As well, the 2D video can be 
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extracted from the 3D stereoscopic video acquired from a 
stereoscopic camera (3D stereoscopic/2D video conversion). 

Conventional technologies have a problem that they 
cannot provide a single- source multi-use environment where 
5 one video content is adapted to and used in different usage 
environments by using video content usage information, i.e., 
user characteristics, natural environment of a user, and 
capability of a user terminal. 

Here, 'a single source' denotes a content generated in 

10 a multimedia source, and 'multi-use' means various user 
terminals having diverse usage environments that consume 
the 'single source' adaptively to their usage environment. 

Single-source multi-use is advantageous because it can 
provide diversified contents with only one content by 

15 adapting the content to different usage environments, and 
further, it can reduce the network bandwidth efficiently 
when it provides the single source adapted to the various 
usage environments . 

Therefore, the content provider can save unnecessary 

20 cost for producing and transmitting a plurality of contents 
to match various usage environments. On the other hand, 
the content consumers can be provided with a video content 
optimized for their diverse usage environments. 

However, conventional technologies do not take the 

25 advantage of single-source multi-user. That is, the 
conventional technologies transmit video contents 
indiscriminately without considering the usage environment, 
such as user characteristics and user terminal 
characteristics. The user terminal having a video player 

30 application consumes the video content with a format 
unchanged as received from the multimedia source. 
Therefore, the conventional technologies can not support 
the single-source multi-use environment. 

If a multimedia source provides a multimedia content 

35 in consideration of various usage environments to overcome 
the problems of the conventional technologies and support 
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the single-source multi-use environment, much load is 
applied to the generation and transmission of the content. 

Disclosure of Invention 

5 

It is, therefore, an object of the present invention 
to provide an apparatus and method for adapting a video 
content to usage environment by using information pre- 
describing the usage environment of a user terminal that 

10 consumes the video content. 

In accordance with one aspect of the present invention, 
there is provided an apparatus for adapting a two- 
dimensional (2D) or three-dimensional (3D) stereoscopic 
video signal for single-source multi-use, including: a 

15 video usage environment information managing unit for 
acquiring, describing and managing user characteristic 
information from a user terminal; and a video adaptation 
unit for adapting the video signal to the video usage 
environment information to generate an adapted 2D video 

20 signal or a 3D stereoscopic video signal and outputting the 
adapted video signal to the user terminal. 

In accordance with another aspect of the present 
invention, there is provided an apparatus for adapting a 2D 
video signal or a 3D stereoscopic video signal for single- 

25 source multi-use, including: a video usage environment 
information managing unit for acquiring, describing and 
managing user terminal characteristic information from a 
user terminal; and a video adaptation unit for adapting 
the video signal to the video usage environment information 

30 to generate an adapted 2D video signal or 3D stereoscopic 
video signal and outputting the adapted video signal to the 
user terminal. 

In accordance with one aspect of the present invention, 
there is provided a method for adapting a 2D video signal 
35 or a 3D stereoscopic video signal for single-source multi- 
use, including the steps of: a) acquiring, describing and 
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managing user characteristic information from a user 
terminal; and b) adapting the video signal to the video 
usage environment information to generate an adapted 2D 
video signal or a 3D stereoscopic video signal and 
5 outputting the adapted video signal to the user terminal. 

In accordance with another aspect of the present 
invention, there is provided a method for adapting a 2D 
video signal or a 3D stereoscopic video signal for single- 
source multi-use, including the steps of: a) acquiring, 

10 describing and managing user terminal characteristic 
information from a user terminal; and b) adapting the 
video signal to the video usage environment information to 
generate an adapted 2D video signal or 3D stereoscopic 
video signal and outputting the adapted video signal to the 

15 user terminal. 

In accordance with one aspect of the present invention, 
there is provided a computer-readable recording medium for 
recording a program that implements a method for adapting a 
2D video signal or a 3D stereoscopic video signal for 

20 single-source multi-use, the method including the steps of: 
a) acquiring, describing and managing user 
characteristic information from a user terminal; and b) 
adapting the video signal to the video usage environment 
information to generate an adapted 2D video signal or 3D 

25 stereoscopic video signal and outputting the adapted video 
signal to the user terminal. 

In accordance with another aspect of the present 
invention, there is provided a computer-readable recording 
medium for recording a program that implements a method for 

30 adapting a 2D video signal or a 3D stereoscopic video 
signal for single-source multi-use, the method including 
the steps of: a) acquiring, describing and managing user 
terminal characteristic information from a user terminal; 
and b) adapting the video signal to the video usage 

35 environment information to generate ah adapted 2D video 
signal or 3D stereoscopic video signal and outputting the 
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adapted video signal to the user terminal. 

Brief Description of Drawings 

5 The above and other objects and features of the 

present invention will become apparent from the following 
description of the preferred embodiments given in 
conjunction with the accompanying drawings, in which: 

Fi 9» L -lp a block diagram illustrating a user terminal 
10 provided with a video adaptation apparatus in accordance 
with an embodiment of the present invention; 

Fig ^ 2 is a block diagram describing a user terminal 
that can be embodied by using the video adaptation 
apparatus of Fig. 1 in accordance with an embodiment of the 
15 present invention; 

Fig . 3 is a flowchart illustrating a video adaptation 
process performed in the video adaptation apparatus of Fig. 
1; Fig. 4 is a flowchart depicting the adaptation 
process of' Fig~. 3; 
20 Fig. 5 is a flowchart showing an adaptation process of 

2D video signal and 3D stereoscopic video signal in 
accordance with a preferred embodiment of the present 
invention; 

Fig \__!- is an exem P lar Y diagram depicting parallaxes in 
25 accordance with the present invention; 

Fig. 7 is an exemplary diagram depicting a range of 
depth in accordance with the present invention; and 

Figs . _8A to 8C are exemplary diagrams illustrating 
rendering methods of 3D stereoscopic video signal in 
30 accordance with the present invention. 

Best Mode for Carrying Out the Invention 

Other objects and aspects of the invention will become 
35 apparent from the following description of the embodiments 
with reference to the accompanying drawings, which is set 
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forth hereinafter. 

Following description exemplifies only the principles 
of the present invention. Even if they are not described 
or illustrated clearly in the present specification, one of 
5 ordinary skill in the art can embody the principles of the 
present invention and invent various apparatuses within the 
concept and scope of the present invention. 

The conditional terms and embodiments presented in the 
present specification are intended only to make understood 
10 the concept of the present invention, and they are not 
limited to the embodiments and conditions mentioned in the 
specification. 

In addition, all the detailed description on the 
principles, viewpoints and embodiments and particular 

15 • embodiments of the present invention should be understood 
to include structural and functional equivalents to them. 
The equivalents include not only the currently known 
equivalents but also those to be developed in future, that 
is, all devices invented to perform the same function, 

20 regardless of their structures. 

For example, block diagrams of the present invention 
should be understood to show a conceptual viewpoint of an 
exemplary circuit that embodies the principles of the 
present invention. Similarly, all the flowcharts, state 

25 conversion diagrams, pseudo codes, and the like can be 
expressed substantially in a computer-readable recording 
media, and whether or not a computer or a processor is 
described in the specification distinctively, they should 
be understood to express a process operated by a computer 

30 or a processor. 

The functions of various devices illustrated in the 
drawings including a functional block expressed as a 
processor or a similar concept can be provided not only by 
using dedicated hardware, but also by using hardware 
35 capable of running proper software. When the function is 
provided by a processor, the provider may be a single 
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dedicated processor, single shared processor, or a 
plurality of individual processors, part of which can be 
shared. 

The apparent use of a term, ' processor ', 'control' or 
similar concept, should not be understood to exclusively 
refer to a piece of hardware capable of running software, 
but should be understood to include a digital signal 
processor (DSP), hardware, and ROM, RAM and non-volatile 
memory for storing software, implicatively . Other known 
and commonly used hardware may be included therein, too. 

In the claims of the present specification, an element 
expressed as a -means- for performing a function described 
in the detailed description is intended to include all 
methods for performing the function including all formats 
of software, such as a combination of circuits that 
performs the function, firmware /microcode, and the like. 
To perform the intended function, the element is cooperated 
with a proper circuit for performing the software. The 
claimed invention includes diverse means for performing 
particular functions, and the means are connected with each 
other in a method requested in the claims. Therefore, any 
means that can provide the function should be understood to 
be an equivalent to- what is figured out from the present 
specification . 

Other objects and aspects of the invention will become 
apparent from the following description of the embodiments 
with reference to the accompanying drawings, which is set 
forth hereinafter. The same reference numeral is given to 
the same element, although the element appears in different 
drawings. In addition, if further detailed description on 
the related prior arts is thought to blur the point of the 
present invention, the description is omitted. Hereafter, 
preferred embodiments of the present invention will be 
described in detail. 

Fig. 1 is a block diagram illustrating a user terminal 
provided with a video adaptation apparatus in accordance 
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10 



with an embodiment of the present invention. Referring to 
Fig. 1, the video adaptation apparatus 10 0 of the 
embodiment of the present invention includes a video 
adaptation portion 103 and a video usage environment 
information managing portion 107. Each of the video 
adaptation portion 103 and the video usage environment 
information managing portion 107 can be provided to a video 
processing system independently from each other. 

The video processing system includes laptops, 
notebooks, desktops , 9 workstations, mainframe computers and 
other types of computers. Data processing or signal 
processing systems, such as Personal Digital Assistant 
(PDA) and wireless communication mobile stations, are 
included in the video processing system. 
15 Th ^ video system may be any one arbitrary selected 

from the nodes that form a network path, e.g., a multimedia 
source node system, a multimedia relay node system, and an 
end user terminal. 

The end user terminal includes a video player, such as 
20 Windows Media Player and Real Player. 

For example, if the video adaptation apparatus 100 is 
mounted on the multimedia source node system and operated, 
it receives pre-described information on the usage 
environment in which the video content is consumed, adapts 
25 the video content to the usage environment, and transmits 
the adapted content to the end user terminal. 

With respect to the video encoding process, a process 
of the video adaptation apparatus 100 processing video data, 
the International Organization for Standardization/ 
30 International Electrotechnical Committee (ISO/IEC) standard 
document of the technical committee of the ISO/IEC may be 
included as part of the present specification as far as it 
is helpful in' describing the functions and operations of 
the elements in the embodiment of the present invention. 
35 A video data source portion 101 receives video data 

generated in a multimedia source. The video data source 
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10 



portion 101 may be included in the multimedia source node 
system, or a multimedia relay node system that receives 
video data transmitted from the multimedia source node 
system through a wired/wireless network, or in the end user 
terminal. 

The video adaptation portion 103 receives video data 
from the video data source portion 101 and adapts the video 
data to the usage environment, e.g., user characteristics 
and user terminal characteristics, by using the usage 
environment information pre-described by the video usage 
environment information managing portion 107. 

The video usage environment information managing 
portion 107 collects information from a user and a user 
terminal, and then describes and manages usage environment 
15 information in advance. 

The video content/metadata output portion 105 outputs 
video data adapted by the video adaptation portion 10 3. 
The outputted video data may be transmitted to a video 
player of the end user terminal, or to a multimedia relay 
20 node system or the end user terminal through a 
wired/wireless network. 

Fig. 2 is a block diagram describing a user terminal 
that can be embodied by using the video adaptation 
apparatus of Fig. 1 in accordance with an embodiment of the 
25 present invention. As illustrated in the drawing, - the 
video data source portion 101 includes video metadata 201 
and a video content 2 03. 

The video data source portion 101 collects video 
contents and metadata from a multimedia source and stores 
30 them. Here, the video content and the metadata are 
obtained from terrestrial, satellite or cable TV signal, 
network such as the Internet, or a recording medium such as 
a VCR, CD or DVD. The video content also includes two- 
dimensional (2D) video or three-dimensional (3D) 
stereoscopic video transmitted in the form of streaming or 
broadcasting. 

10 



35 



WO 2004/008768 



PCTVKR2003/001411 



The video metadata 201 is a description data related 
to video media information, such as the encoding method of 
the video content , size of file, bit-rate r frame/second and 
resolution, and corresponding content information such as, 
title, author, produced time and place, genre and rating of 
video content. The video metadata can be defined and 
described based on extensible Markup Language (XML) schema. 

The video usage environment information managing 
portion 107 includes a user characteristic information 
managing unit 207, a user characteristic information input 
unit 217, a video terminal characteristic information 
managing unit 2 09 and a video terminal characteristic 
information input unit 219. 

The user characteristic information managing unit 2 07 
receives information of user characteristics, such as depth 
and parallax of 3D stereoscopic video content in case of 
2D/3D video conversion, or left and right inter video in 
case of 3D/2D video conversion according to preference or 
favor of user from the user terminal through the user 
characteristic information input unit 217, and manages the 
information of user characteristics. The inputted user 
characteristic information is managed in a language that 
can be readable mechanically, for example, an XML format. 

The video terminal characteristic information 
managing unit 2 09 receives terminal characteristic 
information from the video terminal characteristic 
information input unit 219 and manages the terminal 
characteristic information. The terminal characteristic 
information is managed in a language that can be readable 
mechanically, for example, an XML format. 

The video terminal characteristic information input 
unit 219 transmits the terminal characteristic information 
that is set in advance or inputted by the user to the video 
terminal characteristic information managing unit 2 09. The 
video usage environment information managing portion 107 
receives user terminal characteristic information collected 
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to play a 3D stereoscopic video signal such as whether 
display hardware of the user terminal is monoscopic or 
stereoscopic or whether a video decoder is a stereoscopic 
MPEG-2, stereoscopic MPEG-4 or stereoscopic audio video 
interleave (AVI) video decoder, or whether a rendering 
method is interlaced, sync-double, page-flipping, red-blue 
anaglyph, red-cyan anaglyph, or red-yellow anaglyph. 

The video adaptation portion 103 includes a video 
metadata adaptation unit 213 and a video content adaptation 
unit 215. 

The video content adaptation unit 215 parses the user 
characteristic information and the video terminal 
characteristic information that are managed in the user 
characteristic information input unit 217 and the video 
terminal characteristic information managing unit 2 09, 
respectively, and then adapts the video content suitably to 
the user characteristics and the terminal characteristics. 

That is, the video content adaptation unit 215 
receives and parses the user characteristic information. 
Then, the user preference such as depth, parallax and the 
number of maximum delay frames are reflected in an 
adaptation signal processing process and the 2D video 
content is converted to the 3D stereoscopic video content. 

Also, when the inputted 3D stereoscopic video signal 
is converted to the 2D video signal, left image, right 
image or synthesized image of the inputted 3D stereoscopic 
video signal is reflected and the 3D stereoscopic video 
signal is adapted to the 2D video signal according to the 
preference information of user. 

Also, the video content adaptation unit 215 receives 
the user characteristic information in an XML format from 
the video terminal characteristic information managing unit 
2 09 and parses the user characteristic information. Then, 
the video content adaptation unit 215 executes adaptation 
of the 3D stereoscopic video signal according to the user 
terminal characteristics information such as kinds of 
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display device, 3D stereoscopic video decoder and rendering 
method. 

The video metadata adaptation processing unit 213 
provides metadata needed in the video content adaptation 
5 process to the video content adaptation unit 215, and 
adapts the content of corresponding video metadata 
information based on the result of video content adaptation. 

That is, the video metadata adaptation processing unit 
213 provides metadata needed in the 2D video content or 3D 
10 stereoscopic video adaptation process to the video content 
adaptation unit 215. Then, the video metadata adaptation 
processing unit 213 updates, writes or stores 2D video 
metadata or 3D stereoscopic video metadata based on the 
result of video content adaptation. 
15 The video content/metadata output unit 105 outputs 

contents and metadata of 2D video or 3D stereoscopic video 
adapted according to the user characteristic and the 
terminal characteristic. 

Fig. 3 is a flowchart illustrating a video adaptation 
20 process performed in the video adaptation apparatus of Fig. 
1. Referring to Fig. 3, at step S301, the video usage 
environment information managing portion 107 acquires video 
usage environment information from a user and a user 
terminal, and prescribes information on user 
25 characteristics, user terminal characteristics. 

Subsequently, at step S303, the video * data source 
portion 101 receives video content/metadata. At step S3 05, 
the video adaptation portion 103 adapts the video 
content/metadata received at the step S303 suitably to the 
30 usage environment, i.e., user characteristics, user 
terminal characteristics, by using the usage environment 
information described at the step S301. 

At step S307, the video content/metadata output 
portion 105 outputs 2D video data or 3D stereoscopic video 
35 adapted at the step S3 05. 

Fig. 4 is a flowchart depicting the adaptation process 
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(S305) of Fig. 3. 

Referring to Fig. 4, at step S401, the video 
adaptation portion 103 identifies 2D video content or 3D 
stereoscopic video content and video metadata that the 
video data source portion 101 has received. At step S403, 
the- video adaptation portion 103 adapts the 2D video 
content or 3D stereoscopic video content that needs to be 
adapted suitably to the user characteristics, natural 
environment of the user and user terminal capability. At 
step S405, the video adaptation portion 103 adapts the 
video metadata corresponding to the 2D video content or 3D 
stereoscopic video content based on the result of the video 
content adaptation, which is performed at the step S403. 

Fig. 5 is a flowchart showing an adaptation process of 
2D video signal and 3D stereoscopic video signal in 
accordance with a preferred embodiment of the present 
invention. 

Referring to Fig. 5, a decoder 502 receives an encoded 
MPEG video signal 501, extracts motion vector from each 
16x 16 macro block and executes image type analysis 503 and 
motion type analysis 504. 

During the image type analysis, it is determined 
whether an image is a static image, a horizontal motion 
image, a non-horizontal motion image or a fast motion image. 

During the motion type analysis, motion of camera and 
an object of the moving image are determined. 

3D stereoscopic video 505 is generated from 2D video 
by the image type analysis 5 03 and the motion type analysis 
504. 

An image pixel or 3D depth information of a block is 
obtained from the static image based upon intensity, 
texture and other characteristics. The obtained depth 
information is used to construct a right image or a left 
image . 

A current image or a delayed image is chosen from the 
horizontal motion image. The chosen image is suitably 



14 



WO 2004/008768 



PCT/KR2003/001411 



displayed to a right or left eye of the user according to a 
motion type of the horizontal motion image determined by 
the motion type analysis 504. 

A stereoscopic image is generated from the non- 
5 horizontal motion image according to the motion and the 
depth information 

Herein, a structure of description information that is 
managed in the video usage environment information managing 
portion 107 is described* 
10 m accordance with the present invention, in order to 

adapt a 2D video content or 3D stereoscopic video content 
to usage environment by using pre-described information of 
usage environment where the 2D video content " or 3D 
stereoscopic video content is consumed, usage environment 
15 information, e.g., the information 

StereoscopicVideoConversionType on the user characteristics, 
the information StereoscopicVideoDisplayType on the 
terminal characteristics should be managed. 

The information on the user characteristics describes 
20 user preference on the 2D video or 3D stereoscopic video 
conversion. Shown below is an example of syntax that 
expresses a description information structure of the user 
characteristics which is managed by the video usage 
environment information managing portion 107, shown in Fig. 
25 1, based on the definition of the XML schema. 

<complexType name= n StereoscopicVideoConversionType M > 
<sequence> 
<element 

30 name="From2DTo3DStereoscopic M minOccurs="0 "> 

<complexType> 
<seguence> 

<element name= ,, ParallaxType n > 
<simpleType> 
35 <restriction base=" string"> 

Enumeration value= "Positive" /> 
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<enumeration value= ,f Negative" /> 

</restriction> 

</simpleType> 

</element> 

<element 

name="DepthRange M type="mpeg7 : zeroToOneType" /> 
<element 

name= H MaxDelayedFrame " 
type= " nonNegativelnteger " /> 
</sequence> 
</complexType> 
</element> 
<element 

name= 1< From3DStereoscopicTo2D" minOccurs=" 0 "> 
<complexType> 
<sequence> 

<element name="Lef tRightInterVideo"> 
< s imp 1 e Ty pe> 

<rest.rict.ion base=" string n > 
<enumeration value=."Lef t"/> 
<enumeration value** " Right " /> 
<enumeration valuer " Intermediate " /> 
</restriction> 
< / s imp leType> 
</element> 
</sequence> 
< / c omp 1 exTy pe> 
</element> 
</sequence> 
</complexType> 

Table 1 shows elements of user characteristics. 
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[Table 1] 





Elements 


Data tvoe 




Parallax Type 


String; 

Positive or Negative 


Stereoscopic 
Video Conversion 
Type 


Depth Range 


Mpeg7 : zeroToOneType 


Max Delayed Frame 


Nonnegative Integer 




Left Right Inter 
Video 


String; Left, Right, 
Intermediate 



Referring to the exemplary syntax described by the 
definition of an XML schema , the user characteristics of 
the present invention are divided into two categories such 
as a conversion case from 2D video to 3D stereoscopic video 
From2DTo3DStereoscopic and a conversion case from 3D 
stereoscopic video to 2D video From3DStereoscopicTo2D. 

In case of the conversion from 2D video to 3D 
stereoscopic video, the PrallaxType represents negative 
parallax or positive parallax which is the user preference 
to the type of parallaxes. 

Fig. 6 is an exemplary diagram depicting parallaxes in 
accordance with the present invention. 

Referring to Fig. 6, A represents the negative 
parallax and B represents the positive parallax. That is, 
the 3D depth of objects, i.e., three circles, is perceived 
between the monitor screen and human eyes in case of the 
negative parallax and the objects are perceived behind the 
screen in case of the positive parallax. 

Also, in case of conversion from a 2D video signal to 
a 3D stereoscopic video signal, DepthRange represents a 
user preference to the parallax depth of the 3D 
stereoscopic video signal. The parallax can be increased 
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or decreased according to determination of the range of 3D 
depth . 

Fig. 7 is an exemplary diagram depicting range of 
depth in accordance with the present invention. 

5 Referring to Fig. 7, at a convergence point A, the 

wider depth is perceived compared with B. 

Also, in case of conversion from a 2D video signal to 
a 3D stereoscopic video signal, MaxDelayedFrame represents 
the maximum number' of delayed frames. 

10 One of the stereoscopic conversion schemes is to make 

use of a delayed image. That is f the image sequence is {... , 
lk-3r lk-2r lk-if Ikf — } and I k is the current frame. One of 
the previous frames, Ik- n (n>l) is chosen. Then, a 

stereoscopic image consists of I k and Ik-n- the maximum 

15 number n of delayed frames is determined by MaxDelayedFrame. 

In case of conversion from a 3D stereoscopic video 
signal to a 2D video signal, Lef tRightlnterVideo represents 
a user preference among left image, right image or 
synthesized image in order to obtain an image having better 

20 quality. 

The information on the user terminal characteristics 
represents characteristics information such as whether 
display hardware of the user terminal is monoscopic or 
stereoscopic or whether a video decoder is a stereoscopic 

25 MPEG-2, stereoscopic MPEG-4 or stereoscopic AVI video 
decoder, or whether a rendering method is interlaced, sync- 
double, page-flipping, red-blue anaglyph, red-cyan anaglyph, 
or red-yellow anaglyph. 

Shown below is an example of syntax that expresses a 

30 description information structure of the user terminal 
characteristics which is managed by the video usage 
environment information managing portion 107, shown in Fig. 
1, based on the definition of the XML schema. 

35 <complexType name= " StereoscopicVideoDisplayType " > 

<sequence> 
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<element name="DisplayDevice"> 
<simpleType> 

<restriction base=" string u > 

<enumeration value="Monoscopic , 7> 

<enumer at ion value= " Stereoscopic ■■ /> 

</restriction> 

</simpleType> 

</element> 

<element name= " StereoscopicDecoderType 11 

type= "mpeg7 : Control ledTermUseType " /> 
<element name= " RenderingFormat " > 
<simpleType> 

^restriction base= 11 string "> 
<enumeration value=" Interlaced" /> 
<enumer at ion value= " Sync-Double " /> 
<enumer ation value= "Page-Flipping " /> 
<enumeration value= 11 Anaglyph— Red— Blue " /> 
<enumeration value=" Anaglyph-Red-Cyan " /> 
<enumeration value=" Anaglyph-Red- Yellow" /> 
</restriction> 
</simpleType> / 
</element> 
</sequence> 
</complexType> 

Table 2 shows elements of user characteristics. 



[Table 2] 





Elements 


Data type 


StereoscopicVid 
eoDisplayType 


Display Type 


String 


StereoscopicDecoderType 


Mpeg7 : ControlledTer 
mUseType 




Rendering Format 


String 
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DisplayType represents whether display hardware of the 
user terminal is monoscopic or stereoscopic. 

StreoscopicDecoderType represents whether the video 
5 decoder is a stereoscopic MPEG-2 , stereoscopic MPEG-4 or 
stereoscopic AVI video decoder 

RenderingFormat represents whether the video decoder 
is a stereoscopic MPEG-2 r stereoscopic MPEG-4 or 
stereoscopic AVI video decoder, or whether a rendering 
10 method is interlaced, sync-double, page-flipping, red-blue 
anaglyph, red-cyan anaglyph, or red-yellow anaglyph. 

Figs. 8A to 8C are exemplary diagrams illustrating 
rendering methods of 3D stereoscopic video signal in 
accordance with the present invention. Referring to Figs. 
15 8A to 8C, the rendering methods include interlaced, syn- 
Double and page-flipping. 

Shown below is an example of syntax that expresses a 
description information structure of the user 
characteristics such as preference and favor of user when 
20 2D video signal is adapted to a 3D stereoscopic video 
signal. 

The syntax expresses that PrallaxType represents 
Negative Parallax, 'DepthRange is set to 0.7 and the maximum 
number of delayed frames is 15. 
25 Also, the syntax expresses that the synthesized image 

is chosen among 3D stereoscopic video signals. 



<StereoscopicVideoConversion> 
<From2DTo3DStereoscopic> 
3 0 . <ParallaxType>Negative</ParallaxType> 

<DepthRange>0 . 7</DepthRange> 
<MaxDelayedFrame>15</MaxDelayedFrame> 
</From2DTo3DStereoscopic> 
<From3DStereoscopicTo2D> 

35 

<Lef tRightInterVideo>Intermediate</LeftRightInterVide 
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o> 

</From3DStereoscopicTo2D> 
</StereoscopicVideoConversion> 

Shown below is an example of syntax that expresses a 
description information structure of the user terminal 
characteristics in case of a 3D stereoscopic video signal 
user terminal . 

The user terminal supports a monoscopic display, an 
MPEG-1 video decoder and anaglyph. These user terminal 
characteristics are used for 3D stereoscopic video signal 
users. 

<StereoscopicVideoDisplay> 
<DisplayDevice>Monoscopic</DisplayDevice> 
•<StereoscopicDecoderType 

href="urn :mpeg:mpeg7 : cs : VisualCodingFormatCS : 2 0 01 : 1 "> 
<mpeg7:Name xml : lang= H en n >MPEG-l Video 
</mpeg7 :Name> 
</StereoscopicDecoderType> 

<RenderingFormat>Anaglyph</RenderingFormat> 
</StereoscopicVideoDisplay> 

The method of the present invention can be stored in a 
computer-readable recording medium, e.g., a CD-ROM, a RAM, 
a ROM, a floppy disk, a hard disk, and- an optical/magnetic 
disk. 

As described above, the present invention can provide 
a service environment that can adapt a 2D video content to 
a 3D stereoscopic video content and a 3D stereoscopic video 
content to a 2D video content by using information on 
preference and favor of a user and user terminal 
characteristics in order to comply with different usage 
environments and characteristics and preferences of the 
user. 
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Also, the technology of the present invention can 
provide one single source to a plurality of usage 
environment by adapting the 2D video signal or 3D 
stereoscopic video content to different usage environments 
and users with various characteristics and tastes. 
Therefore, the cost for producing and transmitting a 
plurality of video contents can be saved and the optimal 
* video contents service can be provided by satisfying the 
preferences of user and overcoming limitation of user 
terminal capabilities .While the present invention has been 
shown and described with respect to the particular 
embodiments, it will be apparent to those skilled in the 
art that many changes and modifications may be made without 
departing from the spirit and scope of the invention as 
defined in the appended claims. 
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